MAGNET: A Tool for Debugging, Analyzing and Adapting Computing Systems

نویسندگان

  • Mark K. Gardner
  • Wu-chun Feng
  • Michael Broxton
  • Adam Engelhart
  • Justin Gus Hurwitz
چکیده

As computing systems grow in complexity, the cluster and grid communities require more sophisticated tools to diagnose, debug and analyze such systems. We have developed a toolkit called MAGNET (Monitoring Apparatus for General kerNel-Event Tracing) that provides a detailed look at operating-system kernel events with very low overhead. Using the fine-grained information that MAGNET exports from kernel space, challenging problems become amenable to identification and correction. In this paper, we first present the design, implementation and evaluation of MAGNET. Then, we show its use as a diagnostic tool, an online-monitoring tool and a tool for building adaptive applications in clusters and grids.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Title: Engineering Synthetic Trans-splicing Ribozyme Systems

Natural intron-like self-splicing ribozymes have been re-engineered to trans-splice two arbitrary RNA pieces together. This capability has potential to be tremendously useful for the synthetic biologist. We propose analyzing the suitability of these ribozymes as a tool for engineering biology by adapting trans-splicing ribozymes for use in measuring, debugging, patching, and building biological...

متن کامل

A High-Performance Sensor for Cluster Monitoring and Adaptation

As Beowulf clusters have grown in size and complexity, the task of monitoring the performance, status, and health of such clusters has become increasingly more difficult but also more important. Consequently, tools such as Ganglia and Supermon have emerged in recent years to provide the robust support needed for scalable cluster monitoring. However, the scalability comes at the expense of accur...

متن کامل

Using Complete System Simulation for Temporal Debugging of General Purpose Operating Systems and Workloads

Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal correctness, making them unsuitable for real-time system design and analysis. We propose leveraging complete system simulation to build a temporal debugger capable of analyzing mixed ...

متن کامل

Using Complete System Simulation for Temporal Debugging of General Purpose Operating Systems and Workload

Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal correctness, making them unsuitable for real-time system design and analysis. We propose leveraging complete system simulation to build a temporal debugger capable of analyzing mixed ...

متن کامل

Improving the Resilience of Military Hospitals Through Self-Adaptation of Hospital Systems Using Organic Computing

Background and Aim: Among the failures of a disaster, the disruption of the critical infrastructure of the community causes the most damage to society. Therefore, the ability of critical infrastructure such as hospitals to anticipate, absorb, adapt or rapidly recover from a devastating event is essential. The purpose of this study is to design a self-adaptive model for resilient hospital system...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003